Model Selection with the Loss Rank Principle
نویسندگان
چکیده
A key issue in statistics and machine learning is to automatically select the “right” model complexity, e.g., the number of neighbors to be averaged over in k nearest neighbor (kNN) regression or the polynomial degree in regression with polynomials. We suggest a novel principle the Loss Rank Principle (LoRP) for model selection in regression and classification. It is based on the loss rank, which counts how many other (fictitious) data would be fitted better. LoRP selects the model that has minimal loss rank. Unlike most penalized maximum likelihood variants (AIC, BIC, MDL), LoRP depends only on the regression functions and the loss function. It works without a stochastic noise model, and is directly applicable to any non-parametric regressor, like kNN.
منابع مشابه
Model Selection by Loss Rank for Classification and Unsupervised Learning
Hutter (2007) recently introduced the loss rank principle (LoRP) as a generalpurpose principle for model selection. The LoRP enjoys many attractive properties and deserves further investigations. The LoRP has been well-studied for regression framework in Hutter and Tran (2010). In this paper, we study the LoRP for classification framework, and develop it further for model selection problems in ...
متن کاملThe Loss Rank Principle for Model Selection
We introduce a new principle for model selection in regression and classification. Many regression models are controlled by some smoothness or flexibility or complexity parameter c, e.g. the number of neighbors to be averaged over in k nearest neighbor (kNN) regression or the polynomial degree in regression with polynomials. Let f̂ c D be the (best) regressor of complexity c on data D. A more fl...
متن کاملA Multiple Objective Nonlinear Programming Model for Site Selection of the Facilities Based on the Passive Defense Principles
One of the main principles of the passive defense is the principle of site selection. In this paper, we propose a multiple objective nonlinear programming model that considers the principle of the site selection in terms of two qualitative and quantitative aspects. The purpose of the proposed model is selection of the place of facilities of a system in which not only it observes the dispersion ...
متن کاملUsing the Hybrid GA-TOPSIS Algorithm to Solving the Site Selection Problem in Passive Defense
One of the main principles of the passive defense is the principle of site selection. In this paper, we propose a multiple objective nonlinear programming model that considers the principle of the site selection in terms of two qualitative and quantitative aspects. The purpose of the proposed model is selection of the place of key production facilities of a system in which not only it observes ...
متن کاملMML Invariant Linear Regression
This paper derives two new information theoretic linear regression criteria based on the minimum message length principle. Both criteria are invariant to full rank affine transformations of the design matrix and yield estimates that are minimax with respect to squared error loss. The new criteria are compared against state of the art information theoretic model selection criteria on both real a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 54 شماره
صفحات -
تاریخ انتشار 2010